path selection
Reinforcement Learning-based Adaptive Path Selection for Programmable Networks
Torres, José Eduardo Zerna, Avgeris, Marios, Papagianni, Chrysa, Pongrácz, Gergely, Gódor, István, Grosso, Paola
This work presents a proof-of-concept implementation of a distributed, in-network reinforcement learning (IN-RL) framework for adaptive path selection in programmable networks. By combining Stochastic Learning Automata (SLA) with real-time telemetry data collected via In-Band Network Telemetry (INT), the proposed system enables local, data-driven forwarding decisions that adapt dynamically to congestion conditions. The system is evaluated on a Mininet-based testbed using P4-programmable BMv2 switches, demonstrating how our SLA-based mechanism converges to effective path selections and adapts to shifting network conditions at line rate.
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- Europe > Hungary > Budapest > Budapest (0.04)
Adaptive Context-Aware Multi-Path Transmission Control for VR/AR Content: A Deep Reinforcement Learning Approach
Ahmed, Shakil, Sabuj, Saifur Rahman, Khokhar, Ashfaq
These authors present a few critical features for ACMPTC to enhance applications require high bandwidth, ultra-low latency, and its performance--mainly choosing paths with low latency and consistent quality of service (QoS) to deliver seamless, immersive packet loss. It brings a DRL-based agent that can adapt its experiences [2]. Traditional network protocols like the decision to real-time network states and compute dynamic, Transmission Control Protocol (TCP) often struggle to meet optimal choices. This feedback loop, on the other hand, these stringent demands, especially in highly dynamic and allows for real-time path selection and resource allocation that diverse network environments due to single path transmission, enables continuous optimization to provide a smooth AR/VR inadequate for high-bandwidth, low-latency requirement, high experience even with varying network conditions. It confirms latency sensitivity, etc. [3]. These limitations make TCP less that the system operates correctly and provides a way to update effective for dynamic, heterogeneous network environments such a network when there is variation in traffic levels by and the demanding performance needs of modern applications adjusting it effectively.
- North America > United States > Iowa > Story County > Ames (0.04)
- Asia > Malaysia (0.04)
- Asia > Bangladesh (0.04)
- Energy (0.93)
- Telecommunications > Networks (0.71)
Learning the Optimal Path and DNN Partition for Collaborative Edge Inference
Huang, Yin, Zhang, Letian, Xu, Jie
Recent advancements in Deep Neural Networks (DNNs) have catalyzed the development of numerous intelligent mobile applications and services. However, they also introduce significant computational challenges for resource-constrained mobile devices. To address this, collaborative edge inference has been proposed. This method involves partitioning a DNN inference task into several subtasks and distributing these across multiple network nodes. Despite its potential, most current approaches presume known network parameters -- like node processing speeds and link transmission rates -- or rely on a fixed sequence of nodes for processing the DNN subtasks. In this paper, we tackle a more complex scenario where network parameters are unknown and must be learned, and multiple network paths are available for distributing inference tasks. Specifically, we explore the learning problem of selecting the optimal network path and assigning DNN layers to nodes along this path, considering potential security threats and the costs of switching paths. We begin by deriving structural insights from the DNN layer assignment with complete network information, which narrows down the decision space and provides crucial understanding of optimal assignments. We then cast the learning problem with incomplete network information as a novel adversarial group linear bandits problem with switching costs, featuring rewards generation through a combined stochastic and adversarial process. We introduce a new bandit algorithm, B-EXPUCB, which combines elements of the classical blocked EXP3 and LinUCB algorithms, and demonstrate its sublinear regret. Extensive simulations confirm B-EXPUCB's superior performance in learning for collaborative edge inference over existing algorithms.
- Information Technology > Security & Privacy (0.86)
- Education > Focused Education > Special Education (0.44)
- Information Technology > Communications > Networks (1.00)
- Information Technology > Data Science > Data Mining > Big Data (0.87)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)
DynamicRouteGPT: A Real-Time Multi-Vehicle Dynamic Navigation Framework Based on Large Language Models
Zhou, Ziai, Zhou, Bin, Liu, Hao
Real-time dynamic path planning in complex traffic environments presents challenges, such as varying traffic volumes and signal wait times. Traditional static routing algorithms like Dijkstra and A* compute shortest paths but often fail under dynamic conditions. Recent Reinforcement Learning (RL) approaches offer improvements but tend to focus on local optima, risking dead-ends or boundary issues. This paper proposes a novel approach based on causal inference for real-time dynamic path planning, balancing global and local optimality. We first use the static Dijkstra algorithm to compute a globally optimal baseline path. A distributed control strategy then guides vehicles along this path. At intersections, DynamicRouteGPT performs real-time decision-making for local path selection, considering real-time traffic, driving preferences, and unexpected events. DynamicRouteGPT integrates Markov chains, Bayesian inference, and large-scale pretrained language models like Llama3 8B to provide an efficient path planning solution. It dynamically adjusts to traffic scenarios and driver preferences and requires no pre-training, offering broad applicability across road networks. A key innovation is the construction of causal graphs for counterfactual reasoning, optimizing path decisions. Experimental results show that our method achieves state-of-the-art performance in real-time dynamic path planning for multiple vehicles while providing explainable path selections, offering a novel and efficient solution for complex traffic environments.
- North America > United States > District of Columbia > Washington (0.05)
- Asia > China > Zhejiang Province > Hangzhou (0.05)
- Asia > China > Shandong Province > Qingdao (0.04)
- (5 more...)
- Research Report > Promising Solution (0.66)
- Research Report > New Finding (0.66)
- Transportation > Infrastructure & Services (1.00)
- Transportation > Ground > Road (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- (3 more...)
Robust Path Selection in Software-defined WANs using Deep Reinforcement Learning
Pouryousef, Shahrooz, Gao, Lixin, Towsley, Don
In the context of an efficient network traffic engineering process where the network continuously measures a new traffic matrix and updates the set of paths in the network, an automated process is required to quickly and efficiently identify when and what set of paths should be used. Unfortunately, the burden of finding the optimal solution for the network updating process in each given time interval is high since the computation complexity of optimization approaches using linear programming increases significantly as the size of the network increases. In this paper, we use deep reinforcement learning to derive a data-driven algorithm that does the path selection in the network considering the overhead of route computation and path updates. Our proposed scheme leverages information about past network behavior to identify a set of robust paths to be used for multiple future time intervals to avoid the overhead of updating the forwarding behavior of routers frequently. We compare the results of our approach to other traffic engineering solutions through extensive simulations across real network topologies. Our results demonstrate that our scheme fares well by a factor of 40% with respect to reducing link utilization compared to traditional TE schemes such as ECMP. Our scheme provides a slightly higher link utilization (around 25%) compared to schemes that only minimize link utilization and do not care about path updating overhead.
- North America > United States > Virginia (0.04)
- North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
Majumdar, Arjun, Shrivastava, Ayush, Lee, Stefan, Anderson, Peter, Parikh, Devi, Batra, Dhruv
Following a navigation instruction such as'Walk down the stairs and stop at the brown sofa' requires embodied AI agents to ground scene elements referenced via language (e.g.'stairs') to visual content in the environment (pixels corresponding to'stairs'). We ask the following question - can we leverage abundant'disembodied' web-scraped vision-and-language corpora (e.g. Conceptual Captions [24]) to learn visual groundings (what do'stairs' look like?) that improve performance on a relatively data-starved embodied perception task (Visionand-Language Navigation)? Specifically, we develop VLN-BERT, a visiolinguistic transformer-based model for scoring the compatibility between an instruction ('...stop at the brown sofa') and a sequence of panoramic RGB images captured by the agent. We demonstrate that pretraining VLN-BERT on image-text pairs from the web before fine-tuning on embodied path-instruction data significantly improves performance on VLN - outperforming the prior state-of-the-art in the fully-observed setting by 4 absolute percentage points on success rate. Ablations of our pretraining curriculum show each stage to be impactful - with their combination resulting in further positive synergistic effects.
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.34)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.34)
Unsupervised Pivot Translation for Distant Languages
Leng, Yichong, Tan, Xu, Qin, Tao, Li, Xiang-Yang, Liu, Tie-Yan
Unsupervised neural machine translation (NMT) has attracted a lot of attention recently. While state-of-the-art methods for unsupervised translation usually perform well between similar languages (e.g., English-German translation), they perform poorly between distant languages, because unsupervised alignment does not work well for distant languages. In this work, we introduce unsupervised pivot translation for distant languages, which translates a language to a distant language through multiple hops, and the unsupervised translation on each hop is relatively easier than the original direct translation. We propose a learning to route (LTR) method to choose the translation path between the source and target languages. LTR is trained on language pairs whose best translation path is available and is applied on the unseen language pairs for path selection. Experiments on 20 languages and 294 distant language pairs demonstrate the advantages of the unsupervised pivot translation for distant languages, as well as the effectiveness of the proposed LTR for path selection. Specifically, in the best case, LTR achieves an improvement of 5.58 BLEU points over the conventional direct unsupervised method.
- Oceania > Australia > Victoria > Melbourne (0.04)
- North America > United States > Texas > Dallas County > Dallas (0.04)
- Europe > Germany > Berlin (0.04)
- (2 more...)
Smart Mobile Robots are Everywhere
In 2012 Kiva, a supplier of mobile robots, was bought for $775 million by Amazon. That was a home run for the company which had estimated revenues of about $100 million at the time. Some entrepreneurs and venture capitalists believe, or at least argue, that autonomous mobile robots represent the next revolution in material handling. Automatic guided vehicles carry goods from one point to another in a warehouse or factory on predetermined paths. In contrast, autonomous mobile robots may have a preferred path to get from A to B, but can take alternative routes if there are obstacles or congestion.